Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
El | 1181 | 104 | 7 | 14.8571 |
En | 455 | 28 | 2 | 14.0000 |
euros | 111 | 9 | 1 | 9.0000 |
La | 867 | 67 | 8 | 8.3750 |
según | 136 | 8 | 1 | 8.0000 |
los | 2882 | 245 | 33 | 7.4242 |
A | 167 | 14 | 2 | 7.0000 |
Por | 165 | 14 | 2 | 7.0000 |
ni | 113 | 7 | 1 | 7.0000 |
pero | 314 | 14 | 2 | 7.0000 |
la | 6577 | 574 | 85 | 6.7529 |
las | 1671 | 159 | 26 | 6.1154 |
Y | 172 | 6 | 1 | 6.0000 |
tras | 122 | 6 | 1 | 6.0000 |
Según | 66 | 6 | 1 | 6.0000 |
Los | 351 | 23 | 4 | 5.7500 |
No | 184 | 17 | 3 | 5.6667 |
durante | 133 | 11 | 2 | 5.5000 |
dólares | 45 | 5 | 1 | 5.0000 |
Para | 83 | 5 | 1 | 5.0000 |
Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
millones | 188 | 1 | 9 | 0.1111 |
tipo | 55 | 1 | 7 | 0.1429 |
punto | 62 | 1 | 7 | 0.1429 |
segundo | 47 | 1 | 6 | 0.1667 |
Mundial | 48 | 1 | 6 | 0.1667 |
mujer | 50 | 1 | 6 | 0.1667 |
pueblo | 38 | 1 | 6 | 0.1667 |
EE | 65 | 1 | 5 | 0.2000 |
parte | 190 | 3 | 15 | 0.2000 |
cinco | 67 | 1 | 5 | 0.2000 |
estado | 44 | 1 | 5 | 0.2000 |
mí | 29 | 1 | 5 | 0.2000 |
mayoría | 57 | 1 | 5 | 0.2000 |
debe | 91 | 1 | 5 | 0.2000 |
Estados | 66 | 1 | 5 | 0.2000 |
estudio | 35 | 1 | 5 | 0.2000 |
manera | 55 | 1 | 5 | 0.2000 |
años | 359 | 8 | 37 | 0.2162 |
uno | 131 | 2 | 9 | 0.2222 |
mal | 40 | 1 | 4 | 0.2500 |
In this subsection, we compute the ratio of the number of right neighbors and the number of left neighbors. Again, we look for words with extreme ratios:
Data for first table:
select word,w.freq,aa.cnt, bb.cnt,aa.cnt/bb.cnt as r from words w, (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where w_id=aa.w1_id and aa.w1_id=bb.w2_id order by r desc limit 20;
Diagram data:
select aa.cnt, bb.cnt from (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where aa.w1_id=bb.w2_id;
5.1.7.1 Number of NN co-occurrences vs. Frequency I
5.1.7.2 Number of NN co-occurrences vs. Frequency II